A Study of an Indirect Reward on Multi-agent Environments
نویسنده
چکیده
In a multi-agent learning where multiple agents are learning, there is a problem about an indirect reward that is how to distribute a reward to an agent that does not obtain a reward directly.We have shown the theorem [3] about ”negative effect” of an indirect reward. This paper focuses on the ”positive effect” of an indirect reward such as an elimination of the perceptual aliasing problem [1]. First, we describe the relationship the theorem [3] and the ”positive effect” of the indirect reward. Next, we propose a method to eliminate the perceptual aliasing problem and show the effectiveness of the proposed method by numerical examples.
منابع مشابه
Intelligent multi-agent modeling of the interbank network and evaluation of the impact of regulatory policies
agent-based modeling is an emerging computational technique that makes it possible to simulate complex economic systems, including the banking network, with a bottom-up approach. In this paper, the country's banking network is simulated with an intelligent multi-agent modeling model and indicates that these agents behave based on the adaptive learning. This modeling has been done with the aim o...
متن کاملImproving Agent Performance for Multi-Resource Negotiation Using Learning Automata and Case-Based Reasoning
In electronic commerce markets, agents often should acquire multiple resources to fulfil a high-level task. In order to attain such resources they need to compete with each other. In multi-agent environments, in which competition is involved, negotiation would be an interaction between agents in order to reach an agreement on resource allocation and to be coordinated with each other. In recent ...
متن کاملVoltage Coordination of FACTS Devices in Power Systems Using RL-Based Multi-Agent Systems
This paper describes how multi-agent system technology can be used as the underpinning platform for voltage control in power systems. In this study, some FACTS (flexible AC transmission systems) devices are properly designed to coordinate their decisions and actions in order to provide a coordinated secondary voltage control mechanism based on multi-agent theory. Each device here is modeled as ...
متن کاملA Model for the Effects of Ethical Work Climate, Organizational Trust and Proactive Customer Services Performance: The Role of Perceived Politicizing in Organization’s Reward System
This research aims to investigate the effects of ethical work climate on organizational trust and proactive customer service performance while the mediating role of perceived politicizing in organization’s reward system is considered as well. Statistical population consisted of all employees of Pasargad Insurance Company. By applying random sampling method, 260 employees were selected. Data wer...
متن کاملEfficient Reward Functions for Adaptive Multi-rover Systems
This paper addresses how efficient reward methods can be applied to multiple agents co-evolving in noisy and changing environments, under communication limitations. This problem is approached by “factoring” a global reward over all agents into agent-specific rewards that have two key properties: 1) agents maximizing their agentspecific rewards will tend to maximize the global reward, 2) an agen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016